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Query languages for relational databases 



• Operations on databases: 

- queries: "read" data from the database 

- updates: change the content of the database 

• An update can be seen as a functions that, given a 
database produce another database (without 
changing the schema). 

• A query can be seen as a functions that, given a 
database produce a relation. 
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Query languages for relational databases 



• Foundations can be studied with reference to query languages: 

- Relational Algebra, a "procedural" language, in which the 
data retrieval functions are specified by describing the 
procedure that must be followed in order to obtain the 
result. 

- Relational Calculus, a "declarative" language, in which 
the data retrieval functions describe the properties of the 
results, rather than the procedure used to obtain it. 

- (briefly) Datalog, a more powerful language, which allows 
the formulation of queries that could not be expressed in 
algebra or in calculus. 

• Then, we will study SQL, a practical language (with 
declarative and procedural features) for queries and updates 
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Relational Algebra 



• Relational Algebra is a procedural language, based on 
algebraic concepts. 

• Consists of a collection of operators that 

- are defined on relations 

- produce relations as results and therefore expressions can be 
combined to form complex expressions 

• Operators 

- union, intersection, difference 

- renaming 

- selection 

- projection 

- join (Natural join, Cartesian product. Theta join,...) 
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Union, intersection, difference 



• Relations are sets, so we can apply set operators 

• However, we want the results to be relations (that is, 
homogeneous sets of tuples) 

• Therefore: 

- it is meaningful to apply union, intersection, difference 
only to pairs of relations defined over the same 
attributes (homogeneous sets of tuples). 
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Union 



GRADUATES 



Number 


Surname 


Age 


7274 


Raafat 


37 


7432 


Hegazy 


39 


9824 


Shawky 


38 



MANAGERS 



Number 


Surname 


Age 


9297 


Hegazy 


56 


7432 


Hegazy 


39 


9824 


Shawky 


38 



GRADUATES u MANAGERS 



Number 


Surname 


Age 


7274 


Raafat 


37 


7432 


Hegazy 


39 


9824 


Shawky 


38 


9297 


Hegazy 


56 
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Intersection 



GRADUATES 
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Difference 



GRADUATES 
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A meaningful but impossible union 



PATERNITY 



Father 


Child 


Sherif 

Sherif 

Raouf 

Raouf 


Tamer 

Bahaa 

Maged 

Omar 



MATERNITY 



Mother 


Child 


Faten 

Faten 

Laila 

Nadia 


Tamer 

Bahaa 

Maged 

Omar 



Paternity u Maternity ??? 

• It would be meaningful to execute a sort of union in order to 
obtain all the ‘parent-child’ pairs held in the database, but that is 
not possible. 

• The problem: father and mother are different names, but both 
represent a "parent” 

• The solution: rename attributes 
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Renaming 



• unary operator (has one relation as argument). 

• "changes attribute names" without changing values 

• removes the limitations associated with set operators 

• notation: 

pY^x( r ) 

• Example: 

Pparent^Father(Patemity) 

• if there are two or more attributes involved then 
ordering is meaningful: 

PLocation, Pay <— Branch, Salary^ P^ploySS^ ) 
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Renaming, example 



PATERNITY 



Father 


Child 


Sherif 

Sherif 

Raouf 

Raouf 


Tamer 

Bahaa 

Maged 

Omar 



PParent <- Father(PATERNITY) 



Parent 


Child 


Sherif 

Sherif 

Raouf 

Raouf 


Tamer 

Bahaa 

Maged 

Omar 
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Renaming and union 



PATERNITY 



Father 


Child 


Sherif 

Sherif 

Raouf 

Raouf 


Tamer 

Bahaa 

Maged 

Omar 



MATERNITY 



Mother 


Child 


Faten 

Faten 

Laila 

Nadia 


Tamer 

Bahaa 

Maged 

Omar 



PParent <- Father.(PATERNITY) U p Parent Mother . (MATERNITY) 



Parent 


Child 


Sherif 

Sherif 

Raouf 

Raouf 

Faten 

Faten 

Laila 

Nadia 


Tamer 

Bahaa 

Maged 

Omar 

Tamer 

Bahaa 

Maged 

Omar 
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Renaming and union, with more attributes 



EMPLOYEES STAFF 



Surname 


Branch 


Salary 


Nassar 

Safwat 


Cairo 

London 


4500 

5300 



Surname 


Factory 


Wages 


Hamed 

Ramzy 


Mansoura 

Port-Said 


3300 

3200 



PLocation,Pay 4— Branch, Salary (EMPLOYEES) tJ PLocation, Pay Factory, Wages (STAFF) 



Surname 


Location 


Pay 


Nassar 


Cairo 


4500 


Safwat 


London 


5300 


Hamed 


Mansoura 


3300 


Ramzy 


Port-Said 


3200 
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Selection and projection 



• Two unary operators, in a sense orthogonal: 

- selection for ’'horizontal” decompositions, produces a subset of 
tuples on all the attributes. 



- projection for "vertical" decompositions , produces a result to 
which all the tuples contribute, but on a subset of the 
attributes. 






A B 


C 


Selection 




♦ 






A B 




Projection 






♦ 
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Selection 



• Produce results 

a subset of the tuples (those that satisfy a condition) on 
all the attributes (the same schema as the operand ). 

• Notation: 

tf F ( r ) 

• Semantics: 

— CT F (r) = { 1 1 1 er and t satisfies F} 

— F is a formula like: a (and), v (or), — i (not) 
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Selection, example 



EMPLOYEES 



Surname 


FirstName 


Age 


Salary 


Fawzy 


Monir 


25 


2000 


Kamel 


Sayed 


40 


3000 


Salim 


Nabil 


36 


4500 


Abdel-Aziz 


Flassan 


40 


3900 



a Age<30 v Salary>4000 (EMPLOYEES) 



Surname 


FirstName 


Age 


Salary 


Fawzy 

Salim 


Monir 

Nabil 


25 

36 


2000 

4500 
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Selection, another example 



CITIZENS 



Surname 


FirstName 


PlaceOfBirth 


Residence 


Fawzy 


Monir 


Cairo 


Ismailia 


Kamel 


Sayed 


Cairo 


Cairo 


Selim 


Nabil 


Alex 


Alex 


Abdel-Aziz 


Flasan 


Suez 


Alex 



^ PlaceOfBirth=Residence (CITIZENS) 



Surname 


FirstName 


PlaceOfBirth 


Residence 


Kamel 


Sayed 


Cairo 


Cairo 


Selim 


Nabil 


Alex 


Alex 



Database Systems (P. Atzeni, S. Ceri, S. Paraboschi and R. Torlone) 
Chapter 3 : Relational Algebra and Calculus 



18 




Selection, Precise definition 



Given a relation , a propositional formula F on X is a formula 
obtained by combining atomic conditions of the type A&B or A&c 
with the connectives a (and), v (or), — i (not) where: 

• $ is a comparison operator (=, >, <, <, >); 

• A and B are attributes in X that are compatible (that is, the 
comparison £1 is meaningful on the values of their domains); 

• c is a constant compatible with the domain of A. 

Given a formula F and a tuple t, a truth value is defined for F on t: 
A£1B is true on t if and only if t[A] is in relation £1 with t[B](for 
example, A=B is true on t if and only if t[A] = t[B]); 

• A 0 c is true on t if and only if t[A] is in relation £1 with c; 

• FI v F2, FI a F2 and — i FI have the usual meaning. 

• the selection a F (r) produces a relation on the same attributes as r 
that contains the tuples of r for which F is true. 
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Projection 



• Produce results 

— all the tuples contribute, 

— but on a subset of the attributes. 

• Notation (given a relation r (X) and a subset of 
attributes Y of X): 

7t Y (r) 

• Semantics: 

7i Y (r) = { t[Y] Iter} 
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Projection, example 



EMPLOYEES 



Surname 


FirstName 


Department 


Head 


Fawzy 


Monir 


Sales 


Khamis 


Kamel 


Sayed 


Sales 


Khamis 


Selim 


Nabil 


Personnel 


Marzouk 


Abdel-Aziz 


Hasan 


Personnel 


Marzouk 



^Surname, FirstName (EMPLOYEES) 



Surname 


FirstName 


Fawzy 

Kamel 

Selim 

Abdel-Aziz 


Monir 

Sayed 

Nabil 

Hasan 
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Projection, another example 



EMPLOYEES 



Surname 


FirstName 


Department 


Head 


Fawzy 


Monir 


Sales 


Khamis 


Kamel 


Sayed 


Sales 


Khamis 


Selim 


Nabil 


Personnel 


Marzouk 


Abdel-Aziz 


Hasan 


Personnel 


Marzouk 



^Department, Head (EMPLOYEES) 



Department 


Head 


Sales 

Personnel 


Khamis 

Marzouk 



Equal attributes collapse into a single tuple 

The result contains fewer tuples than the operand, because all the tuples in the 
operand that have equal values on all the attributes of the projection give the 
same contribution to the projection itself. 
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Cardinality of projections 



• The result of a projections contains at most as many 
tuples as the operand 

• It can contain fewer, if several tuples "collapse" 

• 7T Y (r) contains as many tuples as r if and only if Y is a 
superkey for r; 

— this holds also if Y is "by chance" (not defined as a 
superkey in the schema, but superkey for the specific 
instance), see the example 
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Tuples that collapse 



STUDENTS 



RegNum 


Surname 


FirstName 


BirthDate 


DegreeProg 


284328 


Abdel-Aziz 


Fatehy 


29/04/1989 


Computing 


296328 


Abdel-Aziz 


Rashed 


29/04/1989 


Computing 


587614 


Abdel-Aziz 


Sayed 


01/05/1999 


Engineering 


934856 


Kamel 


Sayed 


01/05/1999 


Fine Art 


965536 


Kamel 


Sayed 


05/03/1988 


Fine Art 



^Surname, DegreeProg (STUDENTS) 



Surname 


DegreeProg 


Abdel-Aziz 

Abdel-Aziz 

Kamel 


Computing 
Engineering 
Fine Art 
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Tuples that do not collapse, "by chance" 



STUDENTS 



RegNum 


Surname 


FirstName 


BirthDate 


DegreeProg 


296328 


Abdel-Aziz 


Rashed 


29/04/1989 


Computing 


587614 


Abdel-Aziz 


Sayed 


01/05/1999 


Engineering 


934856 


Kamel 


Sayed 


01/05/1999 


Fine Art 


965536 


Kamel 


Sayed 


05/03/1988 


Engineering 



^Surname, DegreeProg (STUDENTS) 



Surname 


DegreeProg 


Abde-Aziz 

Abde-Aziz 

Kamel 

Kamel 


Computing 
Engineering 
Fine Art 
Engineering 
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Relational Algebra Review 



Basic operations: 

* Renaming (p ) 

* Selection ( a ) 

* Projection ( it ) 

* Difference ( — ) 

* Union ( u ) 
• Intersection ( n ) 



rename the attributes 
gives a subset of rows, 
deletes unwanted columns, 
tuples in relation 1, but not 2 
tuples in relation 1 or 2 or both, 
tuples in both relations. 



Additional operations: 

• Cross-product ( x ) : combine two relations. 

• Join ( [X ) : like x but only keep tuples where common 

fields are equal. 
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Join 



• The most typical operator in relational algebra 

• allows to establish connections among data in 
different relations, taking into advantage the "value- 
based" nature of the relational model 

• Two main versions of the join: 

— " Natural" join: takes attribute names into account 

- " Theta" join 

• They are all denoted by the symbol x 
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A natural join 



• The tuples in the natural join are obtained by combining the tuples 
with equal values on the common attributes. 



R i 







Fawzy 


Sales 


Kamel 


Production 


Badr 


Production 



R 2 







Production 


Ihab 


Sales 


Radwan 



R 1 EX] R 2 









Fawzy 


Sales 


Radwan 


Kamel 


Production 


Ihab 


Badr 


Production 


Ihab 
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Natural join: definition 



• ri (Xj), r 2 (X 2 ) 

• rj x r 2 (natural join of r, and r 2 ) is a relation 
defined on X,X 2 (the union of the two sets): 

{ t on XjX 2 1 t [XJ e r x and t [X 2 ] e r 2 } 

or, equivalently 



{ t on XjX 2 I exist tj e rj and t 2 e r 2 with t [XJ = t x 

and t [X 2 ] = t 2 } 
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Natural join: comments 



• The tuples in the result are obtained by combining tuples in the 
operands with equal values on the common attributes. 

• The degree of the result of a join is less than or equal to the sum 
of the degrees of the two operands, because the common 
attributes of the operands appear only once in the result. 

• The common attributes often form a key of one of the operands; 
there is a referential constraint between the common 
attributes, (remember: references are realized by means of 
keys, and we join in order to follow references). 
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Natural join: example 



Code 


Date 


Officer 


Department 


Registration 


143256 


25/10/92 


567 


75 


5694 FR 


987554 


26/10/92 


456 


75 


5694 FR 


987557 


26/10/92 


456 


75 


6544 XY 


630876 


15/10/92 


456 


47 


6544 XY 


539856 


12/10/92 


567 


47 


6544 XY 



Registration 


Department 


Owner 


Address 


6544 XY 


75 


Cordon Edouard 


Rue du Pont 


7122 HT 


75 


Cordon Edouard 


Rue du Pont 


5694 FR 


75 


Latour Hortense 


Avenue Foch 


6544 XY 


47 


Mimault Bernard 


Avenue FDR 



Offences xi Cars 



Code 


Date 


Officer 


Depart- 

ment 


Regist 

-ration 


Owner 


Address 


143256 


25/10/92 


567 


75 


5694 FR 


Latour Hortense 


Avenue Foch 


987554 


26'' 10/92 


456 


75 


5694 FR 


Latour Hortense 


Avenue Foch 


987557 


26/10/92 


456 


75 


6544 XY 


Cordon Edouard 


Rue du Pont 


630876 


15/10/92 


456 


47 


6544 XY 


Mimault Bernard 


Avenue FDR 


539856 


12/10/92 


567 


47 


6544 XY 


Mimault Bernard 


Avenue FDR 
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Natural join: Another example 



Paternity 


Father 


Child 


Adam 


Cain 


Adam 


Abel 


Abraham 


Isaac 


Abraham 


Ishmael 



Maternity 



Mother 


Child 


Eve 


Cain 


Eve 


Seth 


Sarah 


Isaac 


Hagar 


Ishmael 



Paternity xi Maternity 



Father 


Child 


Mother 


Adam 


Cain 


Eve 


Abraham 


Isaac 


Sarah 


Abraham 


Ishmael 


Hagai’ 
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Joins can be 



"incomplete" 






• If a tuple does not have a "counterpart /equivalent" in the 
other relation, then it does not contribute to the join 
("dangling" tuple) 







Fawzy 


Sales 


Kamel 


Production 


Badr 


Production 







Production 


Ayman 


Purchasing 


Othman 



Ri ^<] R2 









Kamel 


Production 


Ayman 


Badr 


Production 


Ayman 
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Joins can be 



empty 



• As an extreme, we might have that no tuple has a 
counterpart, and all tuples are dangling 








Fawzy 


Sales 


Kamel 


Production 


Badr 


Production 







Marketing 


Ayman 


Purchasing 


Othman 



Ri XI R ; 
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The other extreme 



• If each tuple of each operand can be combined with all 
the tuples of the other . ; then the join has a cardinality that 
is the product of the cardinalities of the operands 
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How many tuples in a join? 



Given rj (Xj), r 2 (X 2 ) 

• the join has a cardinality between zero and the products 
of the cardinalities of the operands: 

0 < I r l |Xt 2 I < I rj I x | r 2 l 

(I r I is the cardinality of relation r) 

• moreover: 

— if the join is complete, then its cardinality is at least the 
maximum of I I and I r 2 l 

— if XjnXj contains a key for r 2 , then I r, txl 2 1 < I r x l 

— if X t nX 2 is the primary key for r 2 , and there is a referential 
constraint between X , r ■ X 2 in i', and such a key, then 
I r t X r 2 l = I r t l 



Database Systems (P. Atzeni, S. Ceri, S. Paraboschi and R. Torlone) 
Chapter 3 : Relational Algebra and Calculus 



36 




Outer joins 



• A variant of the join, to keep all pieces of information from 
the operands. 

• This allows for the possibility that all the tuples contribute 
to the result, extended with null values where there is no 
counterpart. 

• It "pads with nulls" the tuples that have no counterpart 

• Three variants: 

— “Left" : only tuples of the first operand are padded 

— “Right": only tuples of the second operand are padded 

— “Full": tuples of both operands are padded 
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Outer joins 



R 



i 







Fawzy 


Sales 


Kamel 


Production 


Badr 


Production 







Production 


Ayman 


Purchasing 


Othman 



I^^LEFT^2 



RjM 



RIGHT^2 
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N-ary join 



• The natural join is 

- Commutative: R, X R 2 = R 2 X R, 

- Associative: (R 3 XI R 2 ) CXI R 3 = R[ IXI (R 2 X R 3 ) 

• Therefore, we can write n-ary joins without ambiguity: 

R, X R 2 X ... XR n 
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N-ary join 



R, 




r,IX|r 2 IX| r 3 











Kamel 


Production 


A 


Ayman 


Ahmed 


Marketing 


B 


Othman 


Badr 


Production 


A 


Ayman 
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Cartesian product 



• The Natural join is defined also when the operands have 
no attributes in common 



• in this case no condition is imposed on tuples, and 
therefore the result contains tuples obtained by combining 
the tuples of the operands in all possible ways 
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Cartesian product: Example 



EMPLOYEES 



Employee 


Project 


Fawzy 


A 


Ahmed 


A 


Ahmed 


B 



PROJECTS 



Code 


Name 


A 


Adel 


B 


Mahmoud 



EMPLOYES x PROJECTS 



Employee 


Project 


Code 


Name 


Fawzy 


A 


A 


Adel 


Ahmed 


A 


A 


Adel 


Ahmed 


B 


A 


Adel 


Fawzy 


A 


B 


Mahmoud 


Ahmed 


A 


B 


Mahmoud 


Ahmed 


B 


B 


Mahmoud 
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Theta-join 



• In most cases, a Cartesian product is meaningful only if 
followed by a selection: which preserves only the combined 

tuples that satisfy the given requirements. 



• For example, it makes sense to define a Cartesian product on the 
relations EMPLOYEES and PROJECTS, if it is followed by the 
selection that retains only the tuples with equal values on the 
attributes Project and Code. 

• For this reason, another operator is often introduced, the theta- 
join. 
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Theta-join 



— Theta-join: a derived operator, defined by means of other 
operators, it is a Cartesian product followed by a selection. 

R] X p R2 = a P (Ri IXIR2) 



> A theta-join in which the condition of selection F is a 
conjunction of atoms of equality, each with an attribute of the 
first relation and one of the second, is called an Equi-join. 
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Equi-join: example 



EMPLOYEES 



Employee 


Project 


Fawzy 


A 


Ahmed 


A 


Ahmed 


B 



PROJECTS 



Code 


Name 


A 


Adel 


B 


Mahmoud 



EMPLOYES Xp roject=Code PROJECTS 



Employee 


Project 


Code 


Name 


Fawzy 


A 


A 


Adel 


Ahmed 


A 


A 


Adel 


Ahmed 


B 


B 


Mahmoud 
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